Efficiently and Accurately Comparing Real-valued Data Streams

نویسندگان

  • Paolo Capitani
  • Paolo Ciaccia
چکیده

Data streams are pervasive in many modern applications, and there is a pressing need to develop techniques for their efficient management. In this paper we consider real-valued streams and deal with the problem of reporting in real-time all the instants in which their distance falls below a given threshold. Current distance measures, such as Euclidean and Dynamic Time Warping (DTW ), either are inaccurate or are too time-consuming to be applied in a streaming environment. We propose SDTW , a novel DTW -like distance measure which can be continuously updated in constant time and experimentally show that it improves over DTW by orders of magnitude without sacrificing accuracy.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Integer Valued AR(1) with Geometric Innovations

The classical integer valued first-order autoregressive (INA- R(1)) model has been defined on the basis of Poisson innovations. This model has Poisson marginal distribution and is suitable for modeling equidispersed count data. In this paper, we introduce an modification of the INAR(1) model with geometric innovations (INARG(1)) for model- ing overdispersed count data. We discuss some structu...

متن کامل

Complex-Valued Data Envelopment Analysis

Data Envelopment Analysis (DEA) is a nonparametric approach for measuring the relative efficiency of a decision making units consists of multiple inputs and outputs. In all standard DEA models semi positive real valued measures are assumed, while in some real cases inputs and outputs may take complex valued. The question is related to measuring efficiency in such cases. As far as we are aware, ...

متن کامل

A Comparison of Single User Detection and Joint Detection for Centralized and Decentralized Code{Division Multiple{Access

|The achievable spectral eÆciencies attainable with code{division multiple{access when each user and the common receiver can employ multiple antennas are studied. Comparing single user detection and joint detection we give the optimum number of parallel data streams to be transmitted over these antennas in order to reach maximum spectral eÆciency. Besides, we show for single user detection and ...

متن کامل

An Improved Genetic Algorithm Based Complex-valued Encoding

Genetic algorithm is a useful tool to tackle optimization problems. In this paper the complex numbers were introduced into the traditional genetic algorithm, in which binary or real value data representation was used in the past, and a complex-value encoding genetic algorithm was proposed. Comparing with the conventional genetic algorithm which based on real-valued encoding or binary encoding, ...

متن کامل

Sketch ?-metric: Comparing Data Streams via Sketching RESEARCH REPORT

In this paper, we consider the problem of estimating the distance between any two large data streams in smallspace constraint. This problem is of utmost importance in data intensive monitoring applications where input streams are generated rapidly. These streams need to be processed on the fly and accurately to quickly determine any deviance from nominal behavior. We present a new metric, the S...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005